Approaches to Microphone Independence in Automatic Speech Recognition

نویسندگان

Pedro J. Moreno

Uday Jain

Bhiksha Raj

Richard M. Stern

چکیده

This paper describes a series of cepstral-based compensation procedures that render the SPHINX-II system more robust with respect to acoustical changes in the environment. The first algorithm, RATZ (MultivaRiate gAussian based cepsTral normaliZation) requires stereo-data for computing compensation terms, and is similar in philosophy to MFCDCN [ref] (in fact MFCDCN can be thought of as a discrete case of RATZ). We also describe a second algorithm, an improved version of CDCN, that does not require stereo training data and yet achieves performance levels comparable to the RATZ and other stereo algorithms. Use of the various compensation algorithms in consort produces a reduction of error rates for SPHINX-II by as much as 20.0% percent relative to the rate achieved with cepstral mean normalization alone, in both development test sets and in the context of the 1994 ARPA CSR evaluations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptation and Compensation : Approaches to Microphone and Speaker Independence in Automatic Speech Recognition

This paper describes recent efforts by the CMU speech group to address the important problems of robustness to changes in environment and speaker. Results are presented in the context of the 1995 ARPA common Hub 3 evaluation of speech recorded through different microphones at different signal-to-noise ratios (SNRs). For speech that is considered to be of high quality we addressed the problem of...

متن کامل

Signal Processing for Robust Speech Recognition

This chapter compares several di erent approaches to robust automatic speech recognition. We review ongoing research in the use of acoustical pre-processing to achieve robust speech recognition, discussing and comparing approaches based on direct cepstral comparisons, on parametric models of environmental degradation, and on cepstral high-pass ltering. We also describe and compare the e ectiven...

متن کامل

Automatic Speech Recognition of Human-Symbiotic Robot EMIEW

Automatic Speech Recognition (ASR) is an essential function of robots which live in the human world. Many works for ASR have been done for a long time. As a result, computers can recognize human speech well under silent environments. However, accuracy of ASR is greatly degraded under noisy environments. Therefore, noise reduction techniques for ASR are strongly desired. Many approaches based on...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Approaches to Microphone Independence in Automatic Speech Recognition

نویسندگان

چکیده

منابع مشابه

Adaptation and Compensation : Approaches to Microphone and Speaker Independence in Automatic Speech Recognition

Signal Processing for Robust Speech Recognition

Automatic Speech Recognition of Human-Symbiotic Robot EMIEW

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

عنوان ژورنال:

اشتراک گذاری